Closes #22. Added a test that the memory usage doesn't balloon. by rhliang · Pull Request #23 · cfe-lab/hla_algorithm

rhliang · 2026-04-28T23:34:17Z

Also added some other supporting code, like a CI workflow to run the test on a schedule, and also the one-off script that I used to help measure resource consumption of the code.

codecov · 2026-04-28T23:35:09Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 100.00%. Comparing base (d4366d9) to head (776d5b0).
✅ All tests successful. No failed tests found.

Additional details and impacted files

@@            Coverage Diff            @@
##              main       #23   +/-   ##
=========================================
  Coverage   100.00%   100.00%           
=========================================
  Files            5         5           
  Lines          808       803    -5     
  Branches       117       116    -1     
=========================================
- Hits           808       803    -5

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

…ully).

va7eex

I have some suggestions that are completely optional, otherwise I approve.

va7eex · 2026-05-05T18:13:28Z

+    args = parser.parse_args()
+
+    resource_summaries: list[ResourceSummary] = []
+    sample_regex = re.compile(r"^.*/(.*)\.BA\.txt$")


I'd probably add this regex to the other regexes defined above.

va7eex · 2026-05-05T18:20:51Z

+    parser.add_argument("input_dir", help="Directory to scan for HLA sequences")
+    parser.add_argument("--output_csv", help="CSV file summary", default="out.csv")


I would recommend these be both of type Path

https://docs.python.org/3/library/argparse.html#type

You could be a bit more explicit and type the directory as "a directory", see this example: https://stackoverflow.com/a/51212150

va7eex · 2026-05-05T18:29:26Z

+    for exon1_filename in glob.glob(f"{args.input_dir}/*.BA.txt"):
+        sample_name: str = sample_regex.match(exon1_filename).group(1)
+        exon2_filename: str = os.path.join(args.input_dir, f"{sample_name}.BB.txt")
+        with open(exon1_filename) as f:
+            exon1: str = f.read().strip()
+        with open(exon2_filename) as f:
+            exon2: str = f.read().strip()


Typing with Path, this could become

for exon1_filepath in args.input_dr.glob("*.BA.txt"): sample_name: str = sample_regex.match(exon1_filepath.name).group(1) exon2_filepath: Path= exon1_filepath.with_name(exon1_filepath.name.replace("BA.txt", "BB.txt")) exon1 = exon1_filepath.read_text().strip() exon2 = exon2_filepath.read_text().strip() ... json_filepath = args.input_dir / f"{sample_name}.json" json_filepath.write_text(json.dumps(json_input)) ... result = subprocess.run( [ ..., json_filepath.as_posix(), ]

Using David's suggestion from review. Co-authored-by: David Rickett <25559687+va7eex@users.noreply.github.com>

Closes #22. Added a test that the memory usage doesn't balloon.

95250c3

Also added some other supporting code, like a CI workflow to run the test on a schedule, and also the one-off script that I used to help measure resource consumption of the code.

rhliang self-assigned this Apr 28, 2026

rhliang requested a review from va7eex April 28, 2026 23:34

rhliang added this to the v1.1 milestone Apr 28, 2026

Fixed some syntax in the acceptable memory usage test workflow (hopef…

482e3ab

…ully).

va7eex approved these changes May 5, 2026

View reviewed changes

rhliang and others added 2 commits May 5, 2026 15:49

Update src/scripts/measure_resources.py

97b17d6

Using David's suggestion from review. Co-authored-by: David Rickett <25559687+va7eex@users.noreply.github.com>

Incorporated more suggestions from review.

776d5b0

va7eex approved these changes May 6, 2026

View reviewed changes

rhliang merged commit 3be2bcb into main May 6, 2026
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Closes #22. Added a test that the memory usage doesn't balloon.#23

Closes #22. Added a test that the memory usage doesn't balloon.#23
rhliang merged 4 commits into
mainfrom
MemoryOptimization

rhliang commented Apr 28, 2026

Uh oh!

codecov Bot commented Apr 28, 2026 •

edited

Loading

Uh oh!

va7eex left a comment

Uh oh!

Uh oh!

va7eex May 5, 2026

Uh oh!

va7eex May 5, 2026

Uh oh!

va7eex May 5, 2026

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		parser.add_argument("input_dir", help="Directory to scan for HLA sequences")
		parser.add_argument("--output_csv", help="CSV file summary", default="out.csv")

Conversation

rhliang commented Apr 28, 2026

Uh oh!

codecov Bot commented Apr 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

va7eex left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

va7eex May 5, 2026

Choose a reason for hiding this comment

Uh oh!

va7eex May 5, 2026

Choose a reason for hiding this comment

Uh oh!

va7eex May 5, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov Bot commented Apr 28, 2026 •

edited

Loading